by Andrew Trask
In [1]:
def pretty_print_review_and_label(i):
print(labels[i] + "\t:\t" + reviews[i][:80] + "...")
g = open('reviews.txt','r') # What we know!
reviews = list(map(lambda x:x[:-1],g.readlines()))
g.close()
g = open('labels.txt','r') # What we WANT to know!
labels = list(map(lambda x:x[:-1].upper(),g.readlines()))
g.close()
In [2]:
len(reviews)
Out[2]:
25000
In [5]:
reviews[1]
Out[5]:
'story of a man who has unnatural feelings for a pig . starts out with a opening scene that is a terrific example of absurd comedy . a formal orchestra audience is turned into an insane violent mob by the crazy chantings of it s singers . unfortunately it stays absurd the whole time with no general narrative eventually making it just too off putting . even those from the era should be turned off . the cryptic dialogue would make shakespeare seem easy to a third grader . on a technical level it s better than you might think with some good cinematography by future great vilmos zsigmond . future stars sally kirkland and frederic forrest can be seen briefly . '
In [6]:
labels[1]
Out[6]:
'NEGATIVE'
In [7]:
print("labels.txt \t : \t reviews.txt\n")
pretty_print_review_and_label(2137)
pretty_print_review_and_label(12816)
pretty_print_review_and_label(6267)
pretty_print_review_and_label(21934)
pretty_print_review_and_label(5297)
pretty_print_review_and_label(4998)
labels.txt : reviews.txt
NEGATIVE : this movie is terrible but it has some good effects . ...
POSITIVE : adrian pasdar is excellent is this film . he makes a fascinating woman . ...
NEGATIVE : comment this movie is impossible . is terrible very improbable bad interpretat...
POSITIVE : excellent episode movie ala pulp fiction . days suicides . it doesnt get more...
NEGATIVE : if you haven t seen this it s terrible . it is pure trash . i saw this about ...
POSITIVE : this schiffer guy is a real genius the movie is of excellent quality and both e...
In [11]:
import random
for i in range(100):
pretty_print_review_and_label(i+random.randint(10, 100))
NEGATIVE : the storyline was okay . akshay kumar was good as always and that was the only g...
POSITIVE : the ship may have sunk but the movie didn t director james cameron from t...
NEGATIVE : this film is mediocre at best . angie harmon is as funny as a bag of hammers . h...
NEGATIVE : i would have liked to write about the story but there wasn t any . i would hav...
POSITIVE : what s inexplicable firstly the hatred towards this movie . it may not be the...
POSITIVE : to all the miserable people who have done everything from complain about the dia...
POSITIVE : sure titanic was a good movie the first time you see it but you really should...
POSITIVE : the night listener held my attention with robin williams shining as a new york ...
POSITIVE : titanic has to be one of my all time favorite movies . it has its problems wha...
NEGATIVE : this film is about a male escort getting involved in a murder investigation that...
POSITIVE : the night listener is probably not one of william s best roles but he makes a ...
NEGATIVE : i saw this movie at a drive in in . until howard the duck i considered th...
NEGATIVE : such a long awaited movie . . but it has disappointed me and my friends who had ...
POSITIVE : there s so many things to fall for in aro tolbukhin . en la mente del asesino ...
POSITIVE : i avoided watching this film for the longest time . long before it was even rele...
NEGATIVE : shame on yash raj films and aditya chopra who seems to have lost their intellige...
NEGATIVE : this movie is horrible . the acting is a waste basket . no crying no action ho...
POSITIVE : titanic has to be one of my all time favorite movies . it has its problems wha...
POSITIVE : daniell steel s daddy what a refreshing story . this movie glorified the impor...
NEGATIVE : ok i am not japanese . i do know a little about japanese culture and a little ...
NEGATIVE : first lesson that some film makers particularly those inspired by hollywood ne...
POSITIVE : after seeing several movies of villaronga i had a pretty clear opinion about hi...
POSITIVE : somewhat funny and well paced action thriller that has jamie foxx as a hapless ...
POSITIVE : family problems abound in real life and that is what this movie is about . love ...
NEGATIVE : a little girl s dead body is found stripped of all possible means of identifica...
NEGATIVE : absolutely awful movie . utter waste of time . br br background music is s...
NEGATIVE : i just saw the movie in theater . the movie has very few good points to talk abo...
POSITIVE : titanic has to be one of my all time favorite movies . it has its problems wha...
POSITIVE : family problems abound in real life and that is what this movie is about . love ...
NEGATIVE : masters of horror the screwfly solution starts as america is being infected by a...
POSITIVE : for me personally this film goes down in my top four of all time . no exceptions...
NEGATIVE : the worst movie i have seen since tera jadoo chal gaya . there is no story no h...
NEGATIVE : i was fascinated as to how truly bad this movie was . was the viewer supposed to...
NEGATIVE : i would have liked to write about the story but there wasn t any . i would hav...
POSITIVE : titanic is a long but well made tragic adventure love story that takes place dur...
POSITIVE : every once in a while the conversation will turn to favorite movies . i ll me...
NEGATIVE : i m not alone in admiring the first superman movie a film that richard donner ...
NEGATIVE : jewish newspaper reporter justin timberlake as joshua josh pollack is puzzle...
NEGATIVE : the author sets out on a journey of discovery of his roots in the southern t...
POSITIVE : after a brief prologue showing a masked man stalking and then slashing the throa...
POSITIVE : wow . what a wonderful film . the script is nearly perfect it appears this is th...
NEGATIVE : masters of horror the screwfly solution starts as america is being infected by a...
POSITIVE : one of the most heart warming foreign films i ve ever seen . br br the y...
NEGATIVE : warner bros . made many potboilers in the s and most of them are fast paced ...
NEGATIVE : i ve now written reviews for several of the moh episodes and this is among the...
POSITIVE : this movie re wrote film history in every way . no one cares what anyone thinks...
POSITIVE : another aussie masterpiece this delves into the world of the unknown and the su...
POSITIVE : back in do i remember that year clinton bans cloning research the unfortun...
POSITIVE : family problems abound in real life and that is what this movie is about . love ...
NEGATIVE : the curse of frankenstein sticks faithfully to mary shelley s story for one ...
NEGATIVE : i would have liked to write about the story but there wasn t any . i would hav...
NEGATIVE : i grew up on the superman ii theatrical version s t and as a kid i loved...
POSITIVE : to all the miserable people who have done everything from complain about the dia...
POSITIVE : titanic is a classic . i was really surprised that this movie didn t have a sol...
POSITIVE : every once in a while the conversation will turn to favorite movies . i ll me...
NEGATIVE : this is the biggest flop of . i don know what director has is his mind of cr...
POSITIVE : just two comments . . . . seven years apart hardly evidence of the film s rele...
POSITIVE : this movie was excellent . a sad truth to how culture tends to clash with the se...
NEGATIVE : i grew up on the superman ii theatrical version s t and as a kid i loved...
NEGATIVE : tashan the title itself explains the nature of the movie . br br this typ...
POSITIVE : i must admit when i read the description of the genre on netflix as steamy rom...
NEGATIVE : this movie is horrible . the acting is a waste basket . no crying no action ho...
NEGATIVE : so . . . we get so see added footage of brando . . . interesting but not exactly...
NEGATIVE : jewish newspaper reporter justin timberlake as joshua josh pollack is puzzle...
NEGATIVE : all the world said that the film tashan would be a good movie with great pleasur...
POSITIVE : i must admit when i read the description of the genre on netflix as steamy rom...
NEGATIVE : the author sets out on a journey of discovery of his roots in the southern t...
NEGATIVE : warner bros . made many potboilers in the s and most of them are fast paced ...
NEGATIVE : i m not alone in admiring the first superman movie a film that richard donner ...
NEGATIVE : the author sets out on a journey of discovery of his roots in the southern t...
POSITIVE : this movie is worth seeing for the visual beauty and moving acting alone but th...
NEGATIVE : the sight of kareena kapoor in a two piece bikini is about the only thing that ...
NEGATIVE : all the world said that the film tashan would be a good movie with great pleasur...
NEGATIVE : i wasn t terribly impressed with dante s st season offering in homecoming ...
NEGATIVE : this movie is horrible . the acting is a waste basket . no crying no action ho...
POSITIVE : back in do i remember that year clinton bans cloning research the unfortun...
POSITIVE : wang bianlian is an old street performer who is known as a king of masks for h...
NEGATIVE : this is the biggest flop of . i don know what director has is his mind of cr...
NEGATIVE : i found it very very difficulty to watch this after the initial minutes of the ...
NEGATIVE : well at least my theater group did lol . so of course i remember watching grea...
POSITIVE : wang bianlian is an old street performer who is known as a king of masks for h...
NEGATIVE : i grew up on the superman ii theatrical version s t and as a kid i loved...
NEGATIVE : this is an awful film . yea the girls are pretty but its not very good . the plo...
POSITIVE : wow . what a wonderful film . the script is nearly perfect it appears this is th...
POSITIVE : family problems abound in real life and that is what this movie is about . love ...
POSITIVE : the filming is pleasant and the environment is keenly realistic . i liked that i...
NEGATIVE : this is an awful film . yea the girls are pretty but its not very good . the plo...
NEGATIVE : i grew up on the superman ii theatrical version s t and as a kid i loved...
POSITIVE : the filming is pleasant and the environment is keenly realistic . i liked that i...
NEGATIVE : i laughed all the way through this rotten movie . it s so unbelievable . a woma...
NEGATIVE : i gave this stars because it has a lot of interesting themes many here have alr...
POSITIVE : an unexpected pleasure as i had heard nothing about this film . br br sham...
NEGATIVE : warner bros . made many potboilers in the s and most of them are fast paced ...
NEGATIVE : i gave this stars because it has a lot of interesting themes many here have alr...
NEGATIVE : the author sets out on a journey of discovery of his roots in the southern t...
POSITIVE : as i was watching this film on video last night i kept getting these tingles th...
NEGATIVE : this movie was okay but it certainly defeats the claim that homosexuals are bo...
POSITIVE : wang bianlian is an old street performer who is known as a king of masks for h...
NEGATIVE : the curse of frankenstein sticks faithfully to mary shelley s story for one ...
NEGATIVE : i saw this film at its premier at sundance . br br since american beauty...
In [12]:
from collections import Counter
import numpy as np
In [13]:
positive_counts = Counter()
negative_counts = Counter()
total_counts = Counter()
In [15]:
for i in range(len(reviews)):
if(labels[i] == 'POSITIVE'):
for word in reviews[i].split(" "):
positive_counts[word] += 1
else:
for word in reviews[i].split(" "):
negative_counts[word] += 1
total_counts[word] += 1
In [17]:
positive_counts.most_common()
Out[17]:
[('', 550468),
('the', 173324),
('.', 159654),
('and', 89722),
('a', 83688),
('of', 76855),
('to', 66746),
('is', 57245),
('in', 50215),
('br', 49235),
('it', 48025),
('i', 40743),
('that', 35630),
('this', 35080),
('s', 33815),
('as', 26308),
('with', 23247),
('for', 22416),
('was', 21917),
('film', 20937),
('but', 20822),
('movie', 19074),
('his', 17227),
('on', 17008),
('you', 16681),
('he', 16282),
('are', 14807),
('not', 14272),
('t', 13720),
('one', 13655),
('have', 12587),
('be', 12416),
('by', 11997),
('all', 11942),
('who', 11464),
('an', 11294),
('at', 11234),
('from', 10767),
('her', 10474),
('they', 9895),
('has', 9186),
('so', 9154),
('like', 9038),
('about', 8313),
('very', 8305),
('out', 8134),
('there', 8057),
('she', 7779),
('what', 7737),
('or', 7732),
('good', 7720),
('more', 7521),
('when', 7456),
('some', 7441),
('if', 7285),
('just', 7152),
('can', 7001),
('story', 6780),
('time', 6515),
('my', 6488),
('great', 6419),
('well', 6405),
('up', 6321),
('which', 6267),
('their', 6107),
('see', 6026),
('also', 5550),
('we', 5531),
('really', 5476),
('would', 5400),
('will', 5218),
('me', 5167),
('had', 5148),
('only', 5137),
('him', 5018),
('even', 4964),
('most', 4864),
('other', 4858),
('were', 4782),
('first', 4755),
('than', 4736),
('much', 4685),
('its', 4622),
('no', 4574),
('into', 4544),
('people', 4479),
('best', 4319),
('love', 4301),
('get', 4272),
('how', 4213),
('life', 4199),
('been', 4189),
('because', 4079),
('way', 4036),
('do', 3941),
('made', 3823),
('films', 3813),
('them', 3805),
('after', 3800),
('many', 3766),
('two', 3733),
('too', 3659),
('think', 3655),
('movies', 3586),
('characters', 3560),
('character', 3514),
('don', 3468),
('man', 3460),
('show', 3432),
('watch', 3424),
('seen', 3414),
('then', 3358),
('little', 3341),
('still', 3340),
('make', 3303),
('could', 3237),
('never', 3226),
('being', 3217),
('where', 3173),
('does', 3069),
('over', 3017),
('any', 3002),
('while', 2899),
('know', 2833),
('did', 2790),
('years', 2758),
('here', 2740),
('ever', 2734),
('end', 2696),
('these', 2694),
('such', 2590),
('real', 2568),
('scene', 2567),
('back', 2547),
('those', 2485),
('though', 2475),
('off', 2463),
('new', 2458),
('your', 2453),
('go', 2440),
('acting', 2437),
('plot', 2432),
('world', 2429),
('scenes', 2427),
('say', 2414),
('through', 2409),
('makes', 2390),
('better', 2381),
('now', 2368),
('work', 2346),
('young', 2343),
('old', 2311),
('ve', 2307),
('find', 2272),
('both', 2248),
('before', 2177),
('us', 2162),
('again', 2158),
('series', 2153),
('quite', 2143),
('something', 2135),
('cast', 2133),
('should', 2121),
('part', 2098),
('always', 2088),
('lot', 2087),
('another', 2075),
('actors', 2047),
('director', 2040),
('family', 2032),
('between', 2016),
('own', 2016),
('m', 1998),
('may', 1997),
('same', 1972),
('role', 1967),
('watching', 1966),
('every', 1954),
('funny', 1953),
('doesn', 1935),
('performance', 1928),
('few', 1918),
('bad', 1907),
('look', 1900),
('re', 1884),
('why', 1855),
('things', 1849),
('times', 1832),
('big', 1815),
('however', 1795),
('actually', 1790),
('action', 1789),
('going', 1783),
('bit', 1757),
('comedy', 1742),
('down', 1740),
('music', 1738),
('must', 1728),
('take', 1709),
('saw', 1692),
('long', 1690),
('right', 1688),
('fun', 1686),
('fact', 1684),
('excellent', 1683),
('around', 1674),
('didn', 1672),
('without', 1671),
('thing', 1662),
('thought', 1639),
('got', 1635),
('each', 1630),
('day', 1614),
('feel', 1597),
('seems', 1596),
('come', 1594),
('done', 1586),
('beautiful', 1580),
('especially', 1572),
('played', 1571),
('almost', 1566),
('want', 1562),
('yet', 1556),
('give', 1553),
('pretty', 1549),
('last', 1543),
('since', 1519),
('different', 1504),
('although', 1501),
('gets', 1490),
('true', 1487),
('interesting', 1481),
('job', 1470),
('enough', 1455),
('our', 1454),
('shows', 1447),
('horror', 1441),
('woman', 1439),
('tv', 1400),
('probably', 1398),
('father', 1395),
('original', 1393),
('girl', 1390),
('point', 1379),
('plays', 1378),
('wonderful', 1372),
('far', 1358),
('course', 1358),
('john', 1350),
('rather', 1340),
('isn', 1328),
('ll', 1326),
('later', 1324),
('dvd', 1324),
('whole', 1310),
('war', 1310),
('d', 1307),
('found', 1306),
('away', 1306),
('screen', 1305),
('nothing', 1300),
('year', 1297),
('once', 1296),
('hard', 1294),
('together', 1280),
('set', 1277),
('am', 1277),
('having', 1266),
('making', 1265),
('place', 1263),
('might', 1260),
('comes', 1260),
('sure', 1253),
('american', 1248),
('play', 1245),
('kind', 1244),
('perfect', 1242),
('takes', 1242),
('performances', 1237),
('himself', 1230),
('worth', 1221),
('everyone', 1221),
('anyone', 1214),
('actor', 1203),
('three', 1201),
('wife', 1196),
('classic', 1192),
('goes', 1186),
('ending', 1178),
('version', 1168),
('star', 1149),
('enjoy', 1146),
('book', 1142),
('nice', 1132),
('everything', 1128),
('during', 1124),
('put', 1118),
('seeing', 1111),
('least', 1102),
('house', 1100),
('high', 1095),
('watched', 1094),
('loved', 1087),
('men', 1087),
('night', 1082),
('anything', 1075),
('believe', 1071),
('guy', 1071),
('top', 1063),
('amazing', 1058),
('hollywood', 1056),
('looking', 1053),
('main', 1044),
('definitely', 1043),
('gives', 1031),
('home', 1029),
('seem', 1028),
('episode', 1023),
('audience', 1020),
('sense', 1020),
('truly', 1017),
('special', 1011),
('second', 1009),
('short', 1009),
('fan', 1009),
('mind', 1005),
('human', 1001),
('recommend', 999),
('full', 996),
('black', 995),
('help', 991),
('along', 989),
('trying', 987),
('small', 986),
('death', 985),
('friends', 981),
('remember', 974),
('often', 970),
('said', 966),
('favorite', 962),
('heart', 959),
('early', 957),
('left', 956),
('until', 955),
('script', 954),
('let', 954),
('maybe', 937),
('today', 936),
('live', 934),
('less', 934),
('moments', 933),
('others', 929),
('brilliant', 926),
('shot', 925),
('liked', 923),
('become', 916),
('won', 915),
('used', 910),
('style', 907),
('mother', 895),
('lives', 894),
('came', 893),
('stars', 890),
('cinema', 889),
('looks', 885),
('perhaps', 884),
('read', 882),
('enjoyed', 879),
('boy', 875),
('drama', 873),
('highly', 871),
('given', 870),
('playing', 867),
('use', 864),
('next', 859),
('women', 858),
('fine', 857),
('effects', 856),
('kids', 854),
('entertaining', 853),
('need', 852),
('line', 850),
('works', 848),
('someone', 847),
('mr', 836),
('simply', 835),
('picture', 833),
('children', 833),
('face', 831),
('keep', 831),
('friend', 831),
('dark', 830),
('overall', 828),
('certainly', 828),
('minutes', 827),
('wasn', 824),
('history', 822),
('finally', 820),
('couple', 816),
('against', 815),
('son', 809),
('understand', 808),
('lost', 807),
('michael', 805),
('else', 801),
('throughout', 798),
('fans', 797),
('city', 792),
('reason', 789),
('written', 787),
('production', 787),
('several', 784),
('school', 783),
('based', 781),
('rest', 781),
('try', 780),
('dead', 776),
('hope', 775),
('strong', 768),
('white', 765),
('tell', 759),
('itself', 758),
('half', 753),
('person', 749),
('sometimes', 746),
('past', 744),
('start', 744),
('genre', 743),
('beginning', 739),
('final', 739),
('town', 738),
('art', 734),
('humor', 732),
('game', 732),
('yes', 731),
('idea', 731),
('late', 730),
('becomes', 729),
('despite', 729),
('able', 726),
('case', 726),
('money', 723),
('child', 721),
('completely', 721),
('side', 719),
('camera', 716),
('getting', 714),
('instead', 712),
('soon', 702),
('under', 700),
('viewer', 699),
('age', 697),
('days', 696),
('stories', 696),
('felt', 694),
('simple', 694),
('roles', 693),
('video', 688),
('name', 683),
('either', 683),
('doing', 677),
('turns', 674),
('wants', 671),
('close', 671),
('title', 669),
('wrong', 668),
('went', 666),
('james', 665),
('evil', 659),
('budget', 657),
('episodes', 657),
('relationship', 655),
('fantastic', 653),
('piece', 653),
('david', 651),
('turn', 648),
('murder', 646),
('parts', 645),
('brother', 644),
('absolutely', 643),
('head', 643),
('experience', 642),
('eyes', 641),
('sex', 638),
('direction', 637),
('called', 637),
('directed', 636),
('lines', 634),
('behind', 633),
('sort', 632),
('actress', 631),
('lead', 630),
('oscar', 628),
('including', 627),
('example', 627),
('known', 625),
('musical', 625),
('chance', 621),
('score', 620),
('already', 619),
('feeling', 619),
('hit', 619),
('voice', 615),
('moment', 612),
('living', 612),
('low', 610),
('supporting', 610),
('ago', 609),
('themselves', 608),
('reality', 605),
('hilarious', 605),
('jack', 604),
('told', 603),
('hand', 601),
('quality', 600),
('moving', 600),
('dialogue', 600),
('song', 599),
('happy', 599),
('matter', 598),
('paul', 598),
('light', 594),
('future', 593),
('entire', 592),
('finds', 591),
('gave', 589),
('laugh', 587),
('released', 586),
('expect', 584),
('fight', 581),
('particularly', 580),
('cinematography', 579),
('police', 579),
('whose', 578),
('type', 578),
('sound', 578),
('view', 573),
('enjoyable', 573),
('number', 572),
('romantic', 572),
('husband', 572),
('daughter', 572),
('documentary', 571),
('self', 570),
('superb', 569),
('modern', 569),
('took', 569),
('robert', 569),
('mean', 566),
('shown', 563),
('coming', 561),
('important', 560),
('king', 559),
('leave', 559),
('change', 558),
('somewhat', 555),
('wanted', 555),
('tells', 554),
('events', 552),
('run', 552),
('career', 552),
('country', 552),
('heard', 550),
('season', 550),
('greatest', 549),
('girls', 549),
('etc', 547),
('care', 546),
('starts', 545),
('english', 542),
('killer', 541),
('tale', 540),
('guys', 540),
('totally', 540),
('animation', 540),
('usual', 539),
('miss', 535),
('opinion', 535),
('easy', 531),
('violence', 531),
('songs', 530),
('british', 528),
('says', 526),
('realistic', 525),
('writing', 524),
('writer', 522),
('act', 522),
('comic', 521),
('thriller', 519),
('television', 517),
('power', 516),
('ones', 515),
('kid', 514),
('york', 513),
('novel', 513),
('alone', 512),
('problem', 512),
('attention', 509),
('involved', 508),
('kill', 507),
('extremely', 507),
('seemed', 506),
('hero', 505),
('french', 505),
('rock', 504),
('stuff', 501),
('wish', 499),
('begins', 498),
('taken', 497),
('sad', 497),
('ways', 496),
('richard', 495),
('knows', 494),
('atmosphere', 493),
('similar', 491),
('surprised', 491),
('taking', 491),
('car', 491),
('george', 490),
('perfectly', 490),
('across', 489),
('team', 489),
('eye', 489),
('sequence', 489),
('room', 488),
('due', 488),
('among', 488),
('serious', 488),
('powerful', 488),
('strange', 487),
('order', 487),
('cannot', 487),
('b', 487),
('beauty', 486),
('famous', 485),
('happened', 484),
('tries', 484),
('herself', 484),
('myself', 484),
('class', 483),
('four', 482),
('cool', 481),
('release', 479),
('anyway', 479),
('theme', 479),
('opening', 478),
('entertainment', 477),
('slow', 475),
('ends', 475),
('unique', 475),
('exactly', 475),
('easily', 474),
('level', 474),
('o', 474),
('red', 474),
('interest', 472),
('happen', 471),
('crime', 470),
('viewing', 468),
('sets', 467),
('memorable', 467),
('stop', 466),
('group', 466),
('problems', 463),
('dance', 463),
('working', 463),
('sister', 463),
('message', 463),
('knew', 462),
('mystery', 461),
('nature', 461),
('bring', 460),
('believable', 459),
('thinking', 459),
('brought', 459),
('mostly', 458),
('disney', 457),
('couldn', 457),
('society', 456),
('lady', 455),
('within', 455),
('blood', 454),
('parents', 453),
('upon', 453),
('viewers', 453),
('meets', 452),
('form', 452),
('peter', 452),
('tom', 452),
('usually', 452),
('soundtrack', 452),
('local', 450),
('certain', 448),
('follow', 448),
('whether', 447),
('possible', 446),
('emotional', 445),
('killed', 444),
('above', 444),
('de', 444),
('god', 443),
('middle', 443),
('needs', 442),
('happens', 442),
('flick', 442),
('masterpiece', 441),
('period', 440),
('major', 440),
('named', 439),
('haven', 439),
('particular', 438),
('th', 438),
('earth', 437),
('feature', 437),
('stand', 436),
('words', 435),
('typical', 435),
('elements', 433),
('obviously', 433),
('romance', 431),
('jane', 430),
('yourself', 427),
('showing', 427),
('brings', 426),
('fantasy', 426),
('guess', 423),
('america', 423),
('unfortunately', 422),
('huge', 422),
('indeed', 421),
('running', 421),
('talent', 420),
('stage', 419),
('started', 418),
('leads', 417),
('sweet', 417),
('japanese', 417),
('poor', 416),
('deal', 416),
('incredible', 413),
('personal', 413),
('fast', 412),
('became', 410),
('deep', 410),
('hours', 409),
('giving', 408),
('nearly', 408),
('dream', 408),
('clearly', 407),
('turned', 407),
('obvious', 406),
('near', 406),
('cut', 405),
('surprise', 405),
('era', 404),
('body', 404),
('hour', 403),
('female', 403),
('five', 403),
('note', 399),
('learn', 398),
('truth', 398),
('except', 397),
('feels', 397),
('match', 397),
('tony', 397),
('filmed', 394),
('clear', 394),
('complete', 394),
('street', 393),
('eventually', 393),
('keeps', 393),
('older', 393),
('lots', 393),
('buy', 392),
('william', 391),
('stewart', 391),
('fall', 390),
('joe', 390),
('meet', 390),
('unlike', 389),
('talking', 389),
('shots', 389),
('rating', 389),
('difficult', 389),
('dramatic', 388),
('means', 388),
('situation', 386),
('wonder', 386),
('present', 386),
('appears', 386),
('subject', 386),
('comments', 385),
('general', 383),
('sequences', 383),
('lee', 383),
('points', 382),
('earlier', 382),
('gone', 379),
('check', 379),
('suspense', 378),
('recommended', 378),
('ten', 378),
('third', 377),
('business', 377),
('talk', 375),
('leaves', 375),
('beyond', 375),
('portrayal', 374),
('beautifully', 373),
('single', 372),
('bill', 372),
('plenty', 371),
('word', 371),
('whom', 370),
('falls', 370),
('scary', 369),
('non', 369),
('figure', 369),
('battle', 369),
('using', 368),
('return', 368),
('doubt', 367),
('add', 367),
('hear', 366),
('solid', 366),
('success', 366),
('jokes', 365),
('oh', 365),
('touching', 365),
('political', 365),
('hell', 364),
('awesome', 364),
('boys', 364),
('sexual', 362),
('recently', 362),
('dog', 362),
('please', 361),
('wouldn', 361),
('straight', 361),
('features', 361),
('forget', 360),
('setting', 360),
('lack', 360),
('married', 359),
('mark', 359),
('social', 357),
('interested', 356),
('adventure', 356),
('actual', 355),
('terrific', 355),
('sees', 355),
('brothers', 355),
('move', 354),
('call', 354),
('various', 353),
('theater', 353),
('dr', 353),
('animated', 352),
('western', 351),
('baby', 350),
('space', 350),
('leading', 348),
('disappointed', 348),
('portrayed', 346),
('aren', 346),
('screenplay', 345),
('smith', 345),
('towards', 344),
('hate', 344),
('noir', 343),
('outstanding', 342),
('decent', 342),
('kelly', 342),
('directors', 341),
('journey', 341),
('none', 340),
('looked', 340),
('effective', 340),
('storyline', 339),
('caught', 339),
('sci', 339),
('fi', 339),
('cold', 339),
('mary', 339),
('rich', 338),
('charming', 338),
('popular', 337),
('rare', 337),
('manages', 337),
('harry', 337),
('spirit', 336),
('appreciate', 335),
('open', 335),
('moves', 334),
('basically', 334),
('acted', 334),
('inside', 333),
('boring', 333),
('century', 333),
('mention', 333),
('deserves', 333),
('subtle', 333),
('pace', 333),
('familiar', 332),
('background', 332),
('ben', 331),
('creepy', 330),
('supposed', 330),
('secret', 329),
('die', 328),
('jim', 328),
('question', 327),
('effect', 327),
('natural', 327),
('impressive', 326),
('rate', 326),
('language', 326),
('saying', 325),
('intelligent', 325),
('telling', 324),
('realize', 324),
('material', 324),
('scott', 324),
('singing', 323),
('dancing', 322),
('visual', 321),
('adult', 321),
('imagine', 321),
('kept', 320),
('office', 320),
('uses', 319),
('pure', 318),
('wait', 318),
('stunning', 318),
('review', 317),
('previous', 317),
('copy', 317),
('seriously', 317),
('reading', 316),
('create', 316),
('hot', 316),
('created', 316),
('magic', 316),
('somehow', 316),
('stay', 315),
('attempt', 315),
('escape', 315),
('crazy', 315),
('air', 315),
('frank', 315),
('hands', 314),
('filled', 313),
('expected', 312),
('average', 312),
('surprisingly', 312),
('complex', 311),
('quickly', 310),
('successful', 310),
('studio', 310),
('plus', 309),
('male', 309),
('co', 307),
('images', 306),
('casting', 306),
('following', 306),
('minute', 306),
('exciting', 306),
('members', 305),
('follows', 305),
('themes', 305),
('german', 305),
('reasons', 305),
('e', 305),
('touch', 304),
('edge', 304),
('free', 304),
('cute', 304),
('genius', 304),
('outside', 303),
('reviews', 302),
('admit', 302),
('ok', 302),
('younger', 302),
('fighting', 301),
('odd', 301),
('master', 301),
('recent', 300),
('thanks', 300),
('break', 300),
('comment', 300),
('apart', 299),
('emotions', 298),
('lovely', 298),
('begin', 298),
('doctor', 297),
('party', 297),
('italian', 297),
('la', 296),
('missed', 296),
...]
In [18]:
negative_counts.most_common()
Out[18]:
[('', 561462),
('.', 167538),
('the', 163389),
('a', 79321),
('and', 74385),
('of', 69009),
('to', 68974),
('br', 52637),
('is', 50083),
('it', 48327),
('i', 46880),
('in', 43753),
('this', 40920),
('that', 37615),
('s', 31546),
('was', 26291),
('movie', 24965),
('for', 21927),
('but', 21781),
('with', 20878),
('as', 20625),
('t', 20361),
('film', 19218),
('you', 17549),
('on', 17192),
('not', 16354),
('have', 15144),
('are', 14623),
('be', 14541),
('he', 13856),
('one', 13134),
('they', 13011),
('at', 12279),
('his', 12147),
('all', 12036),
('so', 11463),
('like', 11238),
('there', 10775),
('just', 10619),
('by', 10549),
('or', 10272),
('an', 10266),
('who', 9969),
('from', 9731),
('if', 9518),
('about', 9061),
('out', 8979),
('what', 8422),
('some', 8306),
('no', 8143),
('her', 7947),
('even', 7687),
('can', 7653),
('has', 7604),
('good', 7423),
('bad', 7401),
('would', 7036),
('up', 6970),
('only', 6781),
('more', 6730),
('when', 6726),
('she', 6444),
('really', 6262),
('time', 6209),
('had', 6142),
('my', 6015),
('were', 6001),
('which', 5780),
('very', 5764),
('me', 5606),
('see', 5452),
('don', 5336),
('we', 5328),
('their', 5278),
('do', 5236),
('story', 5208),
('than', 5183),
('been', 5100),
('much', 5078),
('get', 5037),
('because', 4966),
('people', 4806),
('then', 4761),
('make', 4722),
('how', 4688),
('could', 4686),
('any', 4658),
('into', 4567),
('made', 4541),
('first', 4306),
('other', 4305),
('well', 4254),
('too', 4174),
('them', 4165),
('plot', 4154),
('movies', 4080),
('acting', 4056),
('will', 3993),
('way', 3989),
('most', 3919),
('him', 3858),
('after', 3838),
('its', 3655),
('think', 3643),
('also', 3608),
('characters', 3600),
('off', 3567),
('watch', 3550),
('character', 3506),
('did', 3506),
('why', 3463),
('being', 3393),
('better', 3358),
('know', 3334),
('over', 3316),
('seen', 3265),
('ever', 3263),
('never', 3259),
('your', 3233),
('where', 3219),
('two', 3173),
('little', 3096),
('films', 3077),
('here', 3027),
('m', 3000),
('nothing', 2990),
('say', 2982),
('end', 2954),
('something', 2942),
('should', 2920),
('many', 2909),
('does', 2871),
('thing', 2866),
('show', 2862),
('ve', 2829),
('scene', 2816),
('scenes', 2785),
('these', 2724),
('go', 2717),
('didn', 2646),
('great', 2640),
('watching', 2640),
('re', 2620),
('doesn', 2601),
('through', 2560),
('such', 2544),
('man', 2516),
('worst', 2480),
('actually', 2449),
('actors', 2437),
('life', 2429),
('back', 2424),
('while', 2418),
('director', 2405),
('funny', 2336),
('going', 2319),
('still', 2283),
('another', 2254),
('look', 2247),
('now', 2237),
('old', 2215),
('those', 2212),
('real', 2170),
('few', 2158),
('love', 2152),
('horror', 2150),
('before', 2147),
('want', 2141),
('minutes', 2126),
('pretty', 2115),
('best', 2094),
('though', 2091),
('same', 2081),
('script', 2074),
('work', 2027),
('every', 2025),
('seems', 2023),
('least', 2011),
('enough', 1997),
('down', 1988),
('original', 1983),
('guy', 1964),
('got', 1952),
('around', 1943),
('part', 1942),
('lot', 1892),
('anything', 1874),
('find', 1860),
('new', 1854),
('again', 1849),
('isn', 1849),
('point', 1845),
('things', 1839),
('fact', 1839),
('give', 1823),
('makes', 1814),
('take', 1800),
('thought', 1798),
('d', 1770),
('whole', 1768),
('long', 1761),
('years', 1759),
('however', 1740),
('gets', 1714),
('making', 1695),
('cast', 1694),
('big', 1662),
('might', 1658),
('interesting', 1648),
('money', 1638),
('us', 1628),
('right', 1625),
('far', 1619),
('quite', 1596),
('without', 1595),
('come', 1595),
('almost', 1574),
('ll', 1567),
('action', 1566),
('awful', 1557),
('kind', 1539),
('reason', 1534),
('am', 1530),
('looks', 1528),
('must', 1522),
('done', 1510),
('comedy', 1504),
('someone', 1490),
('trying', 1486),
('wasn', 1484),
('poor', 1481),
('boring', 1478),
('instead', 1478),
('saw', 1475),
('away', 1469),
('girl', 1463),
('probably', 1444),
('believe', 1434),
('sure', 1433),
('looking', 1430),
('stupid', 1428),
('anyone', 1418),
('times', 1406),
('maybe', 1404),
('world', 1404),
('rather', 1394),
('terrible', 1391),
('may', 1390),
('last', 1390),
('since', 1388),
('let', 1385),
('tv', 1382),
('hard', 1374),
('between', 1374),
('waste', 1358),
('woman', 1356),
('feel', 1354),
('effects', 1348),
('half', 1341),
('own', 1333),
('young', 1317),
('music', 1316),
('idea', 1312),
('sense', 1306),
('bit', 1298),
('having', 1280),
('book', 1278),
('found', 1267),
('put', 1263),
('series', 1263),
('goes', 1256),
('worse', 1249),
('said', 1230),
('comes', 1224),
('role', 1222),
('main', 1220),
('else', 1199),
('everything', 1197),
('yet', 1196),
('low', 1189),
('screen', 1188),
('supposed', 1186),
('actor', 1185),
('either', 1183),
('budget', 1179),
('ending', 1179),
('audience', 1178),
('set', 1177),
('family', 1170),
('left', 1169),
('completely', 1168),
('both', 1158),
('wrong', 1155),
('always', 1151),
('course', 1148),
('place', 1148),
('seem', 1147),
('watched', 1142),
('day', 1132),
('simply', 1130),
('shot', 1126),
('mean', 1117),
('special', 1102),
('dead', 1101),
('three', 1094),
('house', 1085),
('oh', 1084),
('night', 1083),
('read', 1082),
('less', 1067),
('high', 1066),
('year', 1064),
('camera', 1061),
('worth', 1057),
('our', 1056),
('try', 1051),
('horrible', 1046),
('sex', 1046),
('video', 1043),
('black', 1039),
('although', 1036),
('couldn', 1036),
('once', 1033),
('rest', 1022),
('dvd', 1021),
('line', 1018),
('played', 1017),
('fun', 1007),
('during', 1006),
('production', 1003),
('everyone', 1002),
('play', 993),
('mind', 990),
('version', 989),
('kids', 989),
('seeing', 988),
('american', 980),
('given', 978),
('used', 969),
('performance', 968),
('especially', 963),
('together', 963),
('tell', 959),
('women', 958),
('start', 956),
('need', 955),
('second', 953),
('takes', 950),
('each', 950),
('wife', 944),
('dialogue', 942),
('use', 940),
('problem', 938),
('star', 934),
('unfortunately', 931),
('himself', 929),
('doing', 926),
('death', 922),
('name', 921),
('lines', 919),
('killer', 914),
('getting', 913),
('help', 905),
('couple', 902),
('fan', 902),
('head', 898),
('crap', 895),
('guess', 888),
('piece', 884),
('nice', 880),
('different', 878),
('school', 876),
('later', 875),
('entire', 869),
('shows', 860),
('next', 858),
('john', 858),
('short', 857),
('seemed', 857),
('hollywood', 850),
('home', 848),
('true', 846),
('person', 846),
('absolutely', 842),
('sort', 840),
('care', 839),
('understand', 836),
('plays', 835),
('felt', 834),
('written', 829),
('title', 828),
('men', 822),
('until', 821),
('flick', 816),
('decent', 815),
('face', 814),
('friends', 810),
('stars', 807),
('job', 807),
('case', 807),
('itself', 804),
('yes', 801),
('perhaps', 800),
('went', 797),
('wanted', 797),
('called', 796),
('annoying', 795),
('ridiculous', 790),
('tries', 790),
('laugh', 788),
('evil', 787),
('along', 786),
('top', 785),
('hour', 784),
('full', 783),
('came', 780),
('writing', 780),
('keep', 770),
('totally', 767),
('playing', 766),
('god', 765),
('won', 764),
('guys', 763),
('already', 762),
('gore', 757),
('direction', 748),
('save', 746),
('lost', 745),
('example', 744),
('sound', 742),
('war', 741),
('attempt', 735),
('car', 733),
('except', 733),
('moments', 732),
('blood', 732),
('obviously', 730),
('act', 729),
('remember', 728),
('kill', 727),
('truly', 726),
('white', 726),
('father', 726),
('b', 725),
('thinking', 720),
('ok', 716),
('finally', 716),
('turn', 711),
('quality', 701),
('lack', 698),
('style', 694),
('wouldn', 693),
('cheap', 691),
('none', 690),
('kid', 686),
('please', 686),
('boy', 685),
('seriously', 684),
('lead', 680),
('dull', 677),
('children', 676),
('starts', 675),
('stuff', 673),
('hope', 672),
('looked', 670),
('recommend', 669),
('under', 668),
('run', 667),
('killed', 667),
('enjoy', 666),
('others', 666),
('etc', 663),
('myself', 663),
('beginning', 662),
('girls', 662),
('against', 662),
('obvious', 660),
('small', 660),
('hell', 659),
('slow', 657),
('hand', 656),
('wonder', 652),
('lame', 652),
('becomes', 651),
('picture', 651),
('based', 650),
('early', 648),
('behind', 646),
('poorly', 644),
('avoid', 642),
('apparently', 640),
('complete', 640),
('happens', 639),
('anyway', 638),
('classic', 637),
('several', 636),
('despite', 635),
('certainly', 635),
('episode', 635),
('often', 631),
('cut', 630),
('writer', 630),
('mother', 628),
('predictable', 628),
('gave', 628),
('become', 627),
('close', 625),
('fans', 624),
('saying', 621),
('scary', 619),
('stop', 618),
('live', 618),
('wants', 617),
('self', 615),
('mr', 612),
('jokes', 611),
('friend', 611),
('cannot', 610),
('overall', 609),
('cinema', 604),
('child', 603),
('silly', 601),
('beautiful', 596),
('human', 595),
('expect', 594),
('liked', 593),
('happened', 592),
('bunch', 590),
('entertaining', 590),
('actress', 588),
('final', 588),
('says', 584),
('performances', 584),
('turns', 577),
('humor', 577),
('themselves', 576),
('eyes', 576),
('hours', 574),
('happen', 573),
('basically', 572),
('days', 572),
('running', 571),
('involved', 569),
('disappointed', 569),
('call', 569),
('directed', 568),
('group', 568),
('fight', 567),
('daughter', 566),
('talking', 566),
('body', 566),
('badly', 565),
('sorry', 565),
('throughout', 563),
('viewer', 563),
('yourself', 562),
('extremely', 562),
('interest', 561),
('heard', 561),
('violence', 561),
('shots', 559),
('side', 557),
('word', 556),
('art', 555),
('possible', 554),
('dark', 551),
('game', 551),
('hero', 550),
('alone', 549),
('son', 547),
('type', 547),
('leave', 547),
('gives', 546),
('parts', 546),
('single', 546),
('started', 545),
('female', 543),
('rating', 541),
('mess', 541),
('voice', 541),
('aren', 540),
('town', 540),
('drama', 538),
('definitely', 537),
('unless', 536),
('review', 534),
('effort', 533),
('weak', 533),
('able', 533),
('took', 531),
('non', 530),
('five', 530),
('matter', 529),
('usually', 529),
('michael', 528),
('feeling', 526),
('huge', 523),
('sequel', 522),
('soon', 521),
('exactly', 520),
('past', 519),
('turned', 518),
('police', 518),
('tried', 515),
('middle', 513),
('talent', 513),
('genre', 512),
('zombie', 510),
('ends', 509),
('history', 509),
('straight', 503),
('opening', 501),
('serious', 501),
('coming', 501),
('moment', 500),
('lives', 499),
('sad', 499),
('dialog', 498),
('particularly', 498),
('editing', 493),
('clearly', 492),
('beyond', 491),
('earth', 491),
('taken', 490),
('cool', 490),
('level', 489),
('dumb', 489),
('okay', 488),
('major', 487),
('fast', 485),
('premise', 485),
('joke', 484),
('stories', 484),
('wasted', 483),
('minute', 483),
('across', 482),
('mostly', 482),
('rent', 482),
('late', 481),
('falls', 481),
('fails', 481),
('mention', 478),
('theater', 475),
('stay', 472),
('sometimes', 472),
('hit', 468),
('talk', 467),
('fine', 467),
('die', 466),
('storyline', 465),
('pointless', 465),
('taking', 464),
('order', 462),
('brother', 461),
('whatever', 460),
('told', 460),
('wish', 458),
('room', 456),
('career', 455),
('appears', 455),
('write', 455),
('known', 454),
('husband', 454),
('living', 451),
('sit', 450),
('ten', 450),
('words', 449),
('monster', 448),
('chance', 448),
('hate', 444),
('novel', 444),
('add', 443),
('english', 443),
('somehow', 441),
('strange', 440),
('imdb', 438),
('actual', 438),
('total', 437),
('material', 437),
('killing', 437),
('ones', 437),
('knew', 436),
('king', 434),
('number', 434),
('using', 433),
('lee', 431),
('power', 431),
('shown', 431),
('works', 431),
('giving', 431),
('points', 430),
('possibly', 430),
('kept', 430),
('four', 429),
('local', 427),
('usual', 426),
('including', 425),
('problems', 424),
('ago', 424),
('opinion', 424),
('nudity', 423),
('age', 422),
('due', 421),
('roles', 420),
('writers', 419),
('decided', 419),
('near', 418),
('flat', 418),
('easily', 418),
('murder', 417),
('experience', 417),
('reviews', 416),
('imagine', 415),
('feels', 413),
('plain', 411),
('somewhat', 411),
('class', 410),
('score', 410),
('song', 409),
('bring', 409),
('whether', 409),
('otherwise', 408),
('whose', 408),
('average', 408),
('pathetic', 407),
('nearly', 407),
('knows', 407),
('zombies', 407),
('cinematography', 406),
('cheesy', 406),
('upon', 406),
('city', 405),
('space', 405),
('credits', 404),
('james', 403),
('lots', 403),
('change', 403),
('entertainment', 402),
('nor', 402),
('wait', 401),
('released', 400),
('needs', 399),
('shame', 398),
('attention', 396),
('comments', 394),
('bored', 393),
('free', 393),
('lady', 393),
('expected', 392),
('needed', 392),
('clear', 392),
('view', 391),
('development', 390),
('check', 390),
('doubt', 390),
('figure', 389),
('mystery', 389),
('excellent', 388),
('garbage', 388),
('sequence', 386),
('television', 386),
('o', 385),
('sets', 385),
('laughable', 384),
('potential', 384),
('robert', 382),
('light', 382),
('country', 382),
('documentary', 382),
('reality', 382),
('general', 381),
('ask', 381),
('comic', 380),
('fall', 380),
('begin', 380),
('footage', 379),
('stand', 379),
('forced', 379),
('trash', 379),
('remake', 379),
('thriller', 378),
('songs', 378),
('gay', 377),
('within', 377),
('hardly', 376),
('above', 375),
('gone', 375),
('george', 374),
('means', 373),
('sounds', 373),
('directing', 372),
('move', 372),
('david', 372),
('buy', 372),
('rock', 371),
('forward', 371),
('important', 371),
('hot', 370),
('haven', 370),
('filmed', 370),
('british', 370),
('heart', 369),
('reading', 369),
('fake', 369),
('incredibly', 368),
('weird', 368),
('hear', 368),
('enjoyed', 367),
('hilarious', 367),
('cop', 367),
('musical', 367),
('message', 366),
('happy', 366),
('pay', 366),
('laughs', 365),
('box', 365),
('suspense', 363),
('sadly', 363),
('eye', 362),
('third', 361),
('similar', 361),
('named', 361),
('modern', 360),
('failed', 359),
('events', 359),
('forget', 358),
('question', 358),
('male', 357),
('finds', 357),
('perfect', 356),
('spent', 355),
('sister', 355),
('feature', 354),
('result', 354),
('comment', 353),
('girlfriend', 353),
('sexual', 352),
('attempts', 351),
('neither', 351),
('richard', 351),
('screenplay', 350),
('elements', 350),
('spoilers', 349),
('brain', 348),
('filmmakers', 348),
('showing', 348),
('miss', 347),
('dr', 347),
('christmas', 347),
('cover', 345),
('red', 344),
('sequences', 344),
('typical', 343),
('excuse', 343),
('crazy', 342),
('ideas', 342),
('baby', 342),
('loved', 341),
('meant', 341),
('worked', 340),
('fire', 340),
('unbelievable', 339),
('follow', 339),
('theme', 337),
('barely', 336),
('producers', 336),
('twist', 336),
('plus', 336),
('appear', 336),
('directors', 335),
('team', 335),
('viewers', 333),
('leads', 332),
('tom', 332),
('slasher', 332),
('wrote', 331),
('villain', 331),
('gun', 331),
('working', 331),
('island', 330),
('strong', 330),
('open', 330),
('realize', 330),
('positive', 329),
('disappointing', 329),
('yeah', 329),
('quickly', 329),
('weren', 328),
('release', 328),
('simple', 328),
('honestly', 328),
('eventually', 327),
('period', 327),
('tells', 327),
('kills', 327),
('doctor', 327),
('nowhere', 326),
('list', 326),
('acted', 326),
('herself', 326),
('dog', 326),
('walk', 325),
('air', 324),
('apart', 324),
('makers', 323),
('subject', 323),
('learn', 322),
('fi', 322),
('sci', 319),
('bother', 319),
('admit', 319),
('jack', 318),
('disappointment', 318),
('hands', 318),
('note', 318),
('certain', 317),
('e', 317),
('value', 317),
('casting', 317),
('grade', 316),
('peter', 316),
('suddenly', 315),
('missing', 315),
('form', 313),
('stick', 313),
('previous', 313),
('break', 313),
('soundtrack', 312),
('surprised', 311),
('front', 311),
('expecting', 311),
('parents', 310),
('surprise', 310),
('relationship', 310),
('shoot', 309),
('today', 309),
('painful', 308),
('ways', 308),
('leaves', 308),
('ended', 308),
('creepy', 308),
('concept', 308),
('somewhere', 308),
('vampire', 308),
('spend', 307),
('th', 307),
('future', 306),
('difficult', 306),
('effect', 306),
('fighting', 306),
('street', 306),
('c', 305),
('america', 305),
('accent', 304),
('truth', 302),
('project', 302),
('joe', 301),
('f', 301),
('deal', 301),
('indeed', 301),
('biggest', 300),
('rate', 300),
('paul', 299),
('japanese', 299),
('utterly', 298),
('begins', 298),
('redeeming', 298),
('college', 298),
('york', 297),
('fairly', 297),
('disney', 297),
('crew', 296),
('create', 296),
('cartoon', 296),
('revenge', 296),
('co', 295),
('outside', 295),
('computer', 295),
('interested', 295),
('stage', 295),
('considering', 294),
('speak', 294),
('among', 294),
('towards', 293),
('channel', 293),
('sick', 293),
('talented', 292),
('cause', 292),
('particular', 292),
('van', 292),
('hair', 292),
('bottom', 291),
('reasons', 291),
('mediocre', 290),
('cat', 290),
('telling', 290),
('supporting', 289),
('store', 289),
('hoping', 288),
('waiting', 288),
...]
In [19]:
pos_neg_ratios = Counter()
for term,cnt in list(total_counts.most_common()):
if(cnt > 100):
pos_neg_ratio = positive_counts[term] / float(negative_counts[term]+1)
pos_neg_ratios[term] = pos_neg_ratio
for word,ratio in pos_neg_ratios.most_common():
if(ratio > 1):
pos_neg_ratios[word] = np.log(ratio)
else:
pos_neg_ratios[word] = -np.log((1 / (ratio+0.01)))
In [22]:
pos_neg_ratios.most_common()
Out[22]:
[('superb', 1.7091514458966952),
('wonderful', 1.5645425925262093),
('fantastic', 1.5048433868558566),
('excellent', 1.4647538505723599),
('amazing', 1.3919815802404802),
('powerful', 1.2999662776313934),
('favorite', 1.2668956297860055),
('perfect', 1.246742480713785),
('brilliant', 1.2287554137664785),
('recommended', 1.2163953243244932),
('perfectly', 1.1971931173405572),
('subtle', 1.1734135017508081),
('rare', 1.1566438362402944),
('loved', 1.1563661500586044),
('highly', 1.1420208631618658),
('tony', 1.1397491942285991),
('today', 1.1050431789984001),
('awesome', 1.0931328229034842),
('unique', 1.0881409888008142),
('beauty', 1.050410186850232),
('fascinating', 1.0414538748281612),
('greatest', 1.0248947127715422),
('portrayal', 1.0189810189761024),
('incredible', 1.0061677561461084),
('harry', 0.99176919305006062),
('sweet', 0.98966110487955483),
('oscar', 0.98721905111049713),
('complex', 0.97761897738147796),
('solid', 0.97537964824416146),
('beautiful', 0.97326301262841053),
('feelings', 0.95551144502743635),
('paris', 0.95278479030472663),
('heart', 0.95238806924516806),
('masterpiece', 0.94155039863339296),
('themes', 0.94118828349588235),
('charming', 0.92520609553210742),
('impact', 0.91815814604895041),
('funniest', 0.90078654533818991),
('season', 0.89827222637147675),
('compelling', 0.89462923509297576),
('great', 0.88810470901464589),
('tragedy', 0.88563699078315261),
('arthur', 0.87546873735389985),
('gorgeous', 0.8731725250935497),
('enjoyed', 0.87070195951624607),
('natural', 0.86997924506912838),
('moving', 0.85566611005772031),
('lovely', 0.85290640004681306),
('memorable', 0.84801189112086062),
('episodes', 0.84223712084137292),
('strong', 0.84167135777060931),
('smith', 0.83959811108590054),
('apartment', 0.83333115290549531),
('adventure', 0.83150561393278388),
('adds', 0.82485652591452319),
('childhood', 0.82208086393583857),
('realistic', 0.80807714723392232),
('cry', 0.80011930011211307),
('impressed', 0.79258107754813223),
('edge', 0.789774016249017),
('jean', 0.78845736036427028),
('frank', 0.78275933924963248),
('tale', 0.77010822169607374),
('fresh', 0.76158434211317383),
('animated', 0.75768570169751648),
('enjoyable', 0.75246375771636476),
('performances', 0.74883252516063137),
('simple', 0.74641420974143258),
('relationship', 0.74484232345601786),
('supporting', 0.74357803418683721),
('emotional', 0.73678211645681524),
('brings', 0.73142936713096229),
('henry', 0.72642196944481741),
('society', 0.72433010799663333),
('available', 0.72415741730250549),
('best', 0.72347034060446314),
('magic', 0.71878961117328299),
('delivers', 0.71846498854423513),
('jim', 0.71783979315031676),
('relationships', 0.71393795022901896),
('charlie', 0.71024161391924534),
('atmosphere', 0.70744773070214162),
('genius', 0.706392407309966),
('surprisingly', 0.6995780708902356),
('sky', 0.69780919366575667),
('romantic', 0.69664981111114743),
('match', 0.69566924999265523),
('meets', 0.69314718055994529),
('love', 0.69198533541937324),
('paul', 0.68980827929443067),
('andy', 0.68846333124751902),
('performance', 0.68797386327972465),
('unlike', 0.68546468438792907),
('award', 0.6824518914431974),
('ride', 0.68229716453587952),
('dreams', 0.67599410133369586),
('effective', 0.67565402311242806),
('works', 0.67445504754779284),
('master', 0.67015766233524654),
('easy', 0.66895995494594152),
('city', 0.66820823221269321),
('england', 0.66387679825983203),
('sees', 0.66263163663399482),
('both', 0.66248336767382998),
('definitely', 0.66199789483898808),
('appreciate', 0.66083893732728749),
('future', 0.65834665141052828),
('douglas', 0.65540685257709819),
('inspired', 0.65459851044271034),
('marriage', 0.65392646740666405),
('father', 0.65172321672194655),
('page', 0.65123628494430852),
('era', 0.6495567444850836),
('joan', 0.64891392558311978),
('fantasy', 0.64726757480925168),
('personal', 0.64355023942057321),
('william', 0.64083139119578469),
('jack', 0.63838309514997038),
('jane', 0.63443957973316734),
('gives', 0.63383568159497883),
('animation', 0.63208692379869902),
('classic', 0.62504956428050518),
('impressive', 0.62211140744319349),
('artist', 0.62168821657780038),
('moved', 0.6197197120051281),
('innocent', 0.61851219917136446),
('eddie', 0.61691981517206107),
('nature', 0.61594514653194088),
('brian', 0.61344043794920278),
('offers', 0.61207935747116349),
('pleasure', 0.61195702582993206),
('images', 0.61159731359583758),
('games', 0.61067095873570676),
('academy', 0.60872983874736208),
('fine', 0.60496962268013299),
('job', 0.59845562125168661),
('river', 0.59637962862495086),
('believable', 0.59566072133302495),
('always', 0.59470710774669278),
('growing', 0.58466653756587539),
('touch', 0.58122926435596001),
('lives', 0.5810976767513224),
('pre', 0.57700753064729182),
('young', 0.57531672344534313),
('french', 0.5720692490067093),
('war', 0.56843317302781682),
('players', 0.56509525370004821),
('knowing', 0.56489284503626647),
('true', 0.56281525180810066),
('jr', 0.56220982311246936),
('sent', 0.55961578793542266),
('grand', 0.55961578793542266),
('brothers', 0.55891181043362848),
('david', 0.55693122256475369),
('dick', 0.55431073570572953),
('charm', 0.55288175575407861),
('twists', 0.55244729845681018),
('jeff', 0.55179306225421365),
('family', 0.55116244510065526),
('thanks', 0.55049088015842218),
('world', 0.54744234723432639),
('life', 0.54695514434959924),
('color', 0.54405127139431109),
('superior', 0.54333490233128523),
('york', 0.54318235866536513),
('jackson', 0.54232429082536171),
('enjoy', 0.54124285135906114),
('stands', 0.5389965007326869),
('each', 0.5388212312554177),
('different', 0.53709860682460819),
('share', 0.53408248593025787),
('series', 0.5325809226575603),
('fellow', 0.5323318289869543),
('loves', 0.53062825106217038),
('century', 0.53002783074992665),
('musical', 0.52966871156747064),
('approach', 0.52806743020049673),
('moves', 0.5279372642387119),
('tells', 0.52415107836314001),
('radio', 0.52394671172868779),
('uncle', 0.52354439617376536),
('deep', 0.52309571635780505),
('reminds', 0.52157841554225237),
('famous', 0.52118841080153722),
('epic', 0.51919387343650736),
('adult', 0.519167695083386),
('shows', 0.51915322220375304),
('youth', 0.5185626062681431),
('human', 0.51851411224987087),
('tarzan', 0.51813827061227724),
('passion', 0.5162164724008671),
('desire', 0.51607497965213445),
('dirty', 0.51557622652458857),
('fox', 0.51557622652458857),
('fun', 0.51439068993048687),
('south', 0.51420972175023116),
('present', 0.51341965894303732),
('smile', 0.51265880484765169),
('alan', 0.51082562376599072),
('ring', 0.51082562376599072),
('begins', 0.51015650363396647),
('success', 0.50900578704900468),
('japan', 0.50900578704900468),
('accurate', 0.50895471583017893),
('recently', 0.50714914903668207),
('fu', 0.50704490092608467),
('finding', 0.50637127341661037),
('among', 0.50334004951332734),
('viewing', 0.50302139827440906),
('finds', 0.50128303100539795),
('plays', 0.49975983848890226),
('age', 0.49941323171424595),
('roles', 0.49839716550752178),
('james', 0.49837216269470402),
('brought', 0.49783842823917956),
('hilarious', 0.49714551986191058),
('brutal', 0.49681488669639234),
('dance', 0.49581998314812048),
('thoroughly', 0.49414593456733524),
('fully', 0.49213349075383811),
('romance', 0.4901589869574316),
('happy', 0.4898997500608791),
('crime', 0.48977221456815834),
('singing', 0.4893852925281213),
('especially', 0.48901267837860624),
('shakespeare', 0.48754793889664511),
('detail', 0.48609484250827351),
('necessary', 0.48302334245403883),
('humanity', 0.48265474679929443),
('drama', 0.48221998493060503),
('pictures', 0.47929937011921681),
('history', 0.47732966933780852),
('ordinary', 0.47725880012690741),
('episode', 0.47529620261150429),
('role', 0.47520268270188676),
('spirit', 0.47477690799839323),
('ways', 0.47323464982718205),
('familiar', 0.47241617565111949),
('dated', 0.47121648567094482),
('dream', 0.46608972992459924),
('critics', 0.46430560813109778),
('born', 0.46411383518967209),
('detective', 0.4636633473511525),
('higher', 0.46328467899699055),
('remains', 0.46262352194811296),
('information', 0.46034171833399862),
('deserved', 0.45999798712841444),
('lynch', 0.45953232937844013),
('struggle', 0.45911782160048453),
('language', 0.45902121257712653),
('visual', 0.45823514408822852),
('social', 0.45720078250735313),
('reality', 0.45719346885019546),
('hidden', 0.45675840249571492),
('sometimes', 0.45563021171182794),
('modern', 0.45500247579345005),
('popular', 0.45410691533051023),
('surprised', 0.4534409399850382),
('follows', 0.45245361754408348),
('keeps', 0.45234869400701483),
('john', 0.4520909494482197),
('mixed', 0.45198512374305722),
('justice', 0.45142724367280018),
('years', 0.44919197032104968),
('lose', 0.44658335503763702),
('caught', 0.44610275383999071),
('chinese', 0.44507424620321018),
('puts', 0.44279106572085081),
('criminal', 0.4412745608048752),
('minor', 0.4409224199448939),
('liked', 0.44074991514020723),
('de', 0.43983275161237217),
('flaws', 0.43983275161237217),
('light', 0.43884433018199892),
('slowly', 0.43785660389939979),
('comedic', 0.43721380642274466),
('married', 0.43658501682196887),
('murder', 0.4353180712578455),
('physical', 0.4353180712578455),
('johnny', 0.43483971678806865),
('comedies', 0.43395706390247063),
('silent', 0.43395706390247063),
('played', 0.43387244114515305),
('international', 0.43363598507486073),
('vision', 0.43286408229627887),
('intelligent', 0.43196704885367099),
('shop', 0.43078291609245434),
('also', 0.43036720209769169),
('miss', 0.43006426712153217),
('experience', 0.4291068711652048),
('often', 0.42840667735057109),
('disney', 0.42758990438880029),
('events', 0.42744401482693967),
('dancing', 0.42744401482693967),
('forces', 0.42381424677636087),
('boss', 0.42348361361084275),
('key', 0.42306086999854398),
('michael', 0.41985384556026406),
('pace', 0.41907076016394307),
('regular', 0.41796527087239582),
('equally', 0.41702593050924036),
('late', 0.41510042009183645),
('business', 0.41479229098330311),
('later', 0.41304664556056209),
('own', 0.41293340271569018),
('wise', 0.41243377742425769),
('likable', 0.41197978912935806),
('support', 0.41181433578682319),
('live', 0.41137116554424646),
('score', 0.4111262635429026),
('glad', 0.4091619699894905),
('important', 0.40904292945604837),
('well', 0.40898417332336018),
('met', 0.40835946613452889),
('dark', 0.40787765451354829),
('queen', 0.40786031883411922),
('wild', 0.40739003651774874),
('catch', 0.40732557376108414),
('battle', 0.40546510810816438),
('discover', 0.40234498077192071),
('younger', 0.40215931997366489),
('particular', 0.40204630135937885),
('small', 0.39990251475094907),
('recommend', 0.39947706626354179),
('documentary', 0.39935422047536423),
('studio', 0.39903421777787396),
('lucky', 0.39903421777787396),
('political', 0.3986391430377646),
('older', 0.39786050872294498),
('rarely', 0.3975599286010511),
('robert', 0.39584544494568491),
('decade', 0.3957089331627997),
('festival', 0.39505918694704767),
('ice', 0.39370026652857798),
('mary', 0.39227810382621064),
('herself', 0.39212473581937829),
('animals', 0.39122275139262125),
('son', 0.38952363011047619),
('details', 0.3892704221881837),
('early', 0.38837067474886433),
('photography', 0.38711596943996779),
('including', 0.3865071943635503),
('cinema', 0.38486877748306314),
('heaven', 0.38473897759104753),
('culture', 0.38367044828074415),
('jerry', 0.38358239685865669),
('humorous', 0.38341697088640192),
('between', 0.3826616190905876),
('proves', 0.3823693933135151),
('mine', 0.38136755652910381),
('still', 0.38004251519484533),
('view', 0.37962387692427707),
('released', 0.37935836227044345),
('song', 0.37910443841709585),
('step', 0.37879686102600307),
('media', 0.3782154656607889),
('ford', 0.3782154656607889),
('become', 0.37747619820593153),
('exciting', 0.37647757123491205),
('collection', 0.37647757123491205),
('system', 0.37634770953743713),
('creating', 0.37630852381670882),
('parents', 0.37609921330379997),
('release', 0.37564284664554326),
('team', 0.37525132951166773),
('cold', 0.37496165381474977),
('adam', 0.37267528528517352),
('taylor', 0.37129621249291445),
('opposite', 0.37124715079782306),
('successful', 0.3705962824573405),
('genre', 0.37042019954698968),
('lord', 0.36987416300546178),
('although', 0.36979962340393463),
('accept', 0.3695991949196677),
('price', 0.36899542832677301),
('soundtrack', 0.36747898929207806),
('entertaining', 0.36694353008558095),
('u', 0.36617739103512864),
('seven', 0.36615368789327618),
('humour', 0.36602337605686763),
('country', 0.36551305709644938),
('very', 0.36505258934423584),
('becoming', 0.36503153852967385),
('form', 0.36428919392397774),
('mrs', 0.36412376795172297),
('german', 0.36372550193539915),
('return', 0.36290549368936847),
('uses', 0.36251372091256512),
('may', 0.36162314149237229),
('win', 0.36161322557931491),
('stories', 0.36120076939693646),
('wind', 0.36020851652004349),
('ago', 0.35972909878548015),
('includes', 0.35894509247327155),
('past', 0.35821222325761887),
('chemistry', 0.35815899071122737),
('starring', 0.35767444427181588),
('boys', 0.35589032247831454),
('peter', 0.35478040595495058),
('bill', 0.3543734469504532),
('day', 0.35384658780188777),
('became', 0.35319667956240758),
('normal', 0.35296436454219671),
('british', 0.35289422109882629),
('mother', 0.35269246157441486),
('cinematography', 0.3524892921304002),
('th', 0.35211912740287166),
('travel', 0.35176092913630336),
('fate', 0.35040533092513704),
('summer', 0.35020242943311497),
('actions', 0.34967374847974886),
('his', 0.34931337328072987),
('theme', 0.34865470192789649),
('personally', 0.347988678682772),
('plenty', 0.34785755884633829),
('stage', 0.34751146559807738),
('recent', 0.34719619998418855),
('throughout', 0.34705434595174994),
('building', 0.34687094384211148),
('whose', 0.34585871262957568),
('escape', 0.34540086736551434),
('element', 0.34432910811643308),
('western', 0.34333332700115821),
('certain', 0.34274184963480825),
('helped', 0.3417492937220567),
('richard', 0.34092658697059319),
('bruce', 0.33821288109899689),
('halloween', 0.33787181697563612),
('harris', 0.33780645963434952),
('various', 0.33703896942187339),
('professional', 0.33647223662121289),
('truly', 0.33568591851504065),
('songs', 0.33534080146374112),
('meet', 0.33493495730232653),
('fiction', 0.33431628896614474),
('allen', 0.33393309010253663),
('working', 0.33259208516974553),
('indeed', 0.33220581630751173),
('brother', 0.33213383502261484),
('others', 0.33131869289821475),
('created', 0.33079219610550942),
('study', 0.32964627155081305),
('length', 0.32935476885234893),
('japanese', 0.3293037471426003),
('engaging', 0.32880936387564386),
('dramatic', 0.3262157364540238),
('n', 0.325877670189817),
('addition', 0.32466107569286828),
('chance', 0.32430819419122453),
('america', 0.3237870770938972),
('central', 0.32317195743373167),
('led', 0.32302143889708984),
('change', 0.32294408442016082),
('memory', 0.32197114593041304),
('lived', 0.3212730168878164),
('audiences', 0.3207125981282628),
('deal', 0.3202582428863936),
('tough', 0.3193534162894322),
('our', 0.31888367222322705),
('frame', 0.31845373111853459),
('man', 0.31820087291645405),
('red', 0.31766290466371677),
('known', 0.3174542307854511),
('terms', 0.3159667862903629),
('personality', 0.31527002897060996),
('wide', 0.31462670769523299),
('thriller', 0.31436767808346239),
('contains', 0.31321537445694564),
('comic', 0.31295066662666593),
('originally', 0.31177962403084153),
('sing', 0.31131367698505791),
('manages', 0.31069459372438463),
('town', 0.31052454575399091),
('mr', 0.31026367714849018),
('ray', 0.30932124755526214),
('vs', 0.30900484192060212),
('able', 0.30715417586460125),
('issues', 0.30715417586460125),
('situation', 0.3068631313036248),
('friend', 0.30589751234312218),
('overall', 0.30555419721790278),
('tom', 0.30553968985178742),
('creative', 0.30538164955118191),
('states', 0.30501352880342086),
('kong', 0.30492387888628691),
('similar', 0.30479991596874945),
('viewers', 0.30475113250633357),
('adults', 0.30472353830663262),
('rock', 0.30368241379822197),
('dangerous', 0.30368241379822197),
('earlier', 0.30351353766846162),
('search', 0.30330739035486182),
('living', 0.30305010268009497),
('bit', 0.3020130715164977),
('top', 0.30189358591274129),
('rich', 0.30158497762077241),
('portrayed', 0.30126133057816179),
('village', 0.29908470454959252),
('places', 0.29904583110209598),
('grant', 0.29878096551982986),
('eye', 0.29795965521002121),
('somewhat', 0.29794476439690809),
('wars', 0.29644637044536915),
('soon', 0.29626581614317243),
('quite', 0.29407984725856462),
('period', 0.29376111852816317),
('whom', 0.29310213992112011),
('trip', 0.29004893746204713),
('featuring', 0.28978070910870207),
('remember', 0.28973757163387698),
('record', 0.2897291560735058),
('television', 0.28961818147864732),
('narrative', 0.28905475626375304),
('ladies', 0.28571162846448228),
('community', 0.28367405105424215),
('together', 0.28352406230311722),
('us', 0.28306738959710842),
('game', 0.28223246768421617),
('new', 0.28146331508444461),
('himself', 0.27958486221916151),
('conflict', 0.27917138278387232),
('men', 0.27822068644413961),
('art', 0.27774073436393321),
('hit', 0.27750250423831679),
('music', 0.27737860408205628),
('her', 0.27597567116772548),
('follow', 0.2758476148047781),
('decision', 0.27559002668675225),
('drawn', 0.27551153683152568),
('makes', 0.27520789822660519),
('private', 0.27477866761587305),
('lesson', 0.27407642039600238),
('number', 0.27379296029111383),
('president', 0.2736958304770411),
('judge', 0.27311365662349046),
('months', 0.2727404924535819),
('truth', 0.27271919977506848),
('filled', 0.26973963860864258),
('setting', 0.26933293378358436),
('tone', 0.26907688462074658),
('told', 0.26851915373053697),
('george', 0.26747936513426146),
('will', 0.26732095172091325),
('takes', 0.26696419994803383),
('intriguing', 0.26646623330150837),
('style', 0.26623060455034442),
('portray', 0.26570316573300568),
('capture', 0.26543646350446126),
('current', 0.26510775041324192),
('shown', 0.26485403989558004),
('stone', 0.26484258048195813),
('surprise', 0.26409415492730509),
('times', 0.26394848811968658),
('certainly', 0.26381459104513749),
('story', 0.26358920318924967),
('sister', 0.2627963232342036),
('him', 0.26262336538874875),
('williams', 0.26236426446749106),
('yet', 0.26229999918036373),
('pure', 0.26079317223544984),
('pilot', 0.26060111384911017),
('anne', 0.26028309826366652),
('locations', 0.25901417758220913),
('dvd', 0.25889596573330381),
('many', 0.25786034905155958),
('candy', 0.25746829385528441),
('billy', 0.25671984684781407),
('joe', 0.2557197217488224),
('italian', 0.25565282987950433),
('survive', 0.25423413838424086),
('spot', 0.25398822666530829),
('side', 0.25350239533973568),
('soul', 0.25312438352614564),
('hotel', 0.25259075264051079),
('deserves', 0.25131442828090617),
('office', 0.25086809932906462),
('king', 0.25080344206641508),
('nice', 0.25068363282694878),
('latter', 0.24921579162398483),
('pair', 0.24921579162398483),
('law', 0.24882003935973376),
('attention', 0.24851173586333228),
('former', 0.24783616390458127),
('features', 0.24697088739476736),
('street', 0.24696186428206401),
('picture', 0.24498908024018978),
('boy', 0.24334625863172918),
('as', 0.24332065954522725),
('fans', 0.24310302905381345),
('occasionally', 0.24242548086366003),
('indian', 0.24195982311368577),
('moments', 0.24125949896069235),
('american', 0.24072508936400988),
('forever', 0.24033595185458262),
('full', 0.2393382372341904),
('began', 0.23862591546258669),
('luck', 0.23684239567237159),
('difficult', 0.23673159603124905),
('humor', 0.23620664528893415),
('independent', 0.23601006190745993),
('prison', 0.23590580292806096),
('wife', 0.23555300701683429),
('leading', 0.23543138210790299),
('typical', 0.23470437371528577),
('its', 0.23445785826028542),
('sequence', 0.23393779644509025),
('usual', 0.23293155768037266),
('knowledge', 0.2326222952687535),
('message', 0.23236520603166388),
('thank', 0.23233197736861597),
('opens', 0.23180161405732438),
('hasn', 0.23159260580073729),
('reminded', 0.23090555664969903),
('charles', 0.23084859768861493),
('opinion', 0.23017757797158969),
('violent', 0.23001643060197177),
('cast', 0.22984669865679985),
('showed', 0.22957444164450017),
('husband', 0.22884157242884745),
('along', 0.22846608320530892),
('thomas', 0.22843923888892012),
('emotion', 0.22767870647960098),
('final', 0.22687173729661503),
('asks', 0.22587953113308443),
('once', 0.22584782184384555),
('play', 0.22515360224223399),
('angel', 0.22508341613203642),
('honest', 0.22501096548900512),
('leads', 0.2249437318183577),
('focus', 0.2249118983709518),
('moral', 0.22436979303745413),
('mark', 0.22384017272865772),
('note', 0.22377031410501885),
('wedding', 0.22314355131420976),
('costumes', 0.22206188544346864),
('presented', 0.22206188544346864),
('returns', 0.22168263067453486),
('heroes', 0.22032267497256833),
('legend', 0.22015401246584368),
('killers', 0.21950056003570861),
('clever', 0.21808878065258569),
('bob', 0.21808878065258569),
('leader', 0.21772348384487053),
('inside', 0.21715550946958723),
('partner', 0.21667103680859229),
('scott', 0.21622310846963599),
('hollywood', 0.21583133569283269),
('most', 0.21576949086149472),
('powers', 0.21465693743689096),
('viewer', 0.21459649073575129),
('everybody', 0.21440987134545511),
('films', 0.21413624545873913),
('technology', 0.21217451994363576),
('smart', 0.21209371512762482),
('lover', 0.21184399606027632),
('steve', 0.21108962697039713),
('elements', 0.20995150453662395),
('gold', 0.20951729960244461),
('suicide', 0.20895891632225325),
('learn', 0.20879968206178118),
('feature', 0.20781540562017428),
('several', 0.20763936477824455),
('include', 0.20763936477824455),
('children', 0.20736236925456764),
('thus', 0.20697069206860877),
('members', 0.20688303044242937),
('loss', 0.20661424936299921),
('against', 0.20641312305500026),
('star', 0.2061007485600686),
('o', 0.20536995223024557),
('round', 0.20479441264601322),
('event', 0.20391218938632222),
('unknown', 0.20278262866529465),
('caused', 0.2023684887392454),
('showing', 0.20171209102619739),
('actresses', 0.20121819068524419),
('moment', 0.20012618142746136),
('find', 0.19954652319879115),
('died', 0.19948936041632001),
('english', 0.19944143900742142),
('copy', 0.19822014286175296),
('americans', 0.19735943415849519),
('year', 0.19707910617291846),
('everyone', 0.19667468614876915),
('commentary', 0.19497267434751336),
('days', 0.19446394361968003),
('changed', 0.1944245789651983),
('leaves', 0.19358474907266526),
('code', 0.1930660960769319),
('named', 0.19285520124940034),
('proved', 0.19259310711578442),
('home', 0.19228354952270224),
('historical', 0.19129022677671509),
('knows', 0.19126834278376073),
('career', 0.19105523676270922),
('sets', 0.19049188820406651),
('friends', 0.1903044054499502),
('books', 0.19029550263309888),
('has', 0.18887466019234553),
('she', 0.18812315883348638),
('aware', 0.18759861389479834),
('and', 0.18744824888788403),
('within', 0.18540322333136275),
('imagination', 0.18540322333136275),
('interested', 0.18457127652797004),
('developed', 0.18322694390876712),
('grown', 0.18232155679395459),
('based', 0.18206550763111667),
('show', 0.18127315550973783),
('willing', 0.18117935221517753),
('while', 0.18101161831162818),
('eventually', 0.18079600348511757),
('won', 0.17904823144898546),
('power', 0.17768117723745241),
('child', 0.17706493935013418),
('gary', 0.17693070815907824),
('aspects', 0.17589066646366419),
('race', 0.17531101230773724),
('subject', 0.17509385367250144),
('west', 0.17233299891044881),
('teenage', 0.17220342662836996),
('j', 0.17095779814363948),
('manner', 0.17072980863016957),
('london', 0.16960278438617996),
('continue', 0.16923392203858686),
('sudden', 0.16907633004393391),
('entertainment', 0.16857992894165819),
('haven', 0.16829735046773628),
('though', 0.16811984981797709),
('real', 0.16793949493155988),
('affair', 0.16753776060971765),
('mystery', 0.16725130387295642),
('course', 0.16712103026988581),
('graphic', 0.16705408466316624),
('above', 0.16623541904233033),
('version', 0.16534322025953679),
('pieces', 0.16507975035944861),
('artistic', 0.16507975035944861),
('parts', 0.16480151437379129),
('finale', 0.16454938704815697),
('feel', 0.16432541490121119),
('chosen', 0.16345307248957175),
('two', 0.16221957782822941),
('short', 0.16211092086564666),
('ones', 0.16194799028728998),
('class', 0.16142343915633808),
('he', 0.16126968303520792),
('soldier', 0.16126814759612232),
('miles', 0.16126814759612232),
('feeling', 0.16090472414323376),
('compare', 0.15984870094189604),
('roll', 0.15963014559188393),
('following', 0.15906469462968728),
('limited', 0.15822400521489419),
('morning', 0.15762894420358306),
('towards', 0.15706189003471671),
('party', 0.15639787178416298),
('slightly', 0.15529288440603525),
('starting', 0.15524059819128397),
('noticed', 0.15415067982725836),
('again', 0.15399622774570954),
('turns', 0.15365624223992966),
('considered', 0.15287435546765354),
('rise', 0.1528392042294475),
('anthony', 0.15180601286800413),
('presence', 0.15113862935726671),
('secret', 0.15061085312213432),
('particularly', 0.1504220077889464),
('canadian', 0.15006069457573326),
('until', 0.14997094542455039),
('christopher', 0.14967639943233715),
('fear', 0.14842000511827322),
('image', 0.14763599880606468),
('uk', 0.14745273114313062),
('comedy', 0.14624098022947199),
('their', 0.14569896839951851),
('work', 0.14566166450234702),
('due', 0.14531009181713533),
('adaptation', 0.14531009181713533),
('la', 0.14518200984449789),
('ahead', 0.14489135441446163),
('ball', 0.14458122881110749),
('reaction', 0.14424960884454671),
('lady', 0.14394650965301667),
('band', 0.14364270222884329),
('al', 0.14348172347769078),
('worth', 0.14328986169246002),
('ben', 0.14263615961744186),
('novel', 0.1422015630045293),
('background', 0.14217448878054254),
('hope', 0.14111769970861912),
('wanting', 0.14069988210313511),
('who', 0.13963110655212385),
('mysterious', 0.13815033848081718),
('date', 0.13783247452391326),
('in', 0.13773074964958743),
('stand', 0.13747099062860288),
('trust', 0.13685918271719735),
('saw', 0.1365755350057507),
('despite', 0.13647516866853593),
('sexy', 0.13473259397015666),
('develop', 0.13459015397276475),
('finally', 0.13422849965867853),
('mood', 0.13415001312058869),
('came', 0.13401143103681315),
('is', 0.13363870656907434),
('efforts', 0.13353139262452257),
('process', 0.13264447460629469),
('action', 0.13249384115269827),
('brief', 0.13150230839774157),
('technical', 0.13090557054685498),
('reach', 0.12921173148000625),
('by', 0.12853075861071547),
('land', 0.12689175185462376),
('extra', 0.12675170563914381),
('voice', 0.12635626636687172),
('must', 0.12628259646883916),
('toward', 0.12604072089536497),
('mid', 0.12516314295400599),
('scared', 0.12475170497247684),
('included', 0.12470347850095724),
('author', 0.12379421734666433),
('disturbing', 0.12361395596717663),
('easily', 0.12333640177293873),
('aspect', 0.12296171113483492),
('largely', 0.12296171113483492),
('moore', 0.12296171113483492),
('fit', 0.12260232209233228),
('held', 0.12260232209233228),
('playing', 0.12255217541328564),
('passed', 0.12160713209478698),
('matter', 0.12071374740446419),
('other', 0.12061743163780955),
('road', 0.11940773138338527),
('forgotten', 0.11844815041319409),
('choice', 0.11689375147149943),
('questions', 0.11679926774625068),
('vote', 0.11672427430814139),
('seeing', 0.11632145801691783),
('those', 0.11592359847967949),
('planet', 0.11543616764656794),
('bring', 0.11506932978478729),
('introduced', 0.11477551459242825),
('four', 0.11415890536299227),
('twice', 0.11352382629698717),
('european', 0.11279549414534429),
('force', 0.11274062295544887),
('mainly', 0.11179140598811663),
('becomes', 0.11162917008200528),
('directed', 0.11131812921379115),
('coming', 0.11112078583096703),
('fan', 0.1109924669366235),
('more', 0.11097538681569005),
('constantly', 0.11042381761437305),
('hold', 0.11000089521432849),
('during', 0.1099181377350741),
('police', 0.10939859440710656),
('singer', 0.10919929196499201),
('protagonist', 0.10889408823913739),
('murders', 0.10880285984879917),
('beginning', 0.10852293076233929),
('immediately', 0.10809649547670099),
('negative', 0.10775286129644605),
('of', 0.10766910804438501),
('villains', 0.10763066419236536),
('post', 0.10754154160418654),
('with', 0.10743222230268429),
('telling', 0.10742024862083691),
('upon', 0.10707894004043897),
('area', 0.10660973505825827),
('loses', 0.10616019582839073),
('safe', 0.10536051565782635),
('eyes', 0.10518719041257044),
('sequences', 0.1044905721492862),
('large', 0.10422350587075116),
('respect', 0.10409389104263336),
('green', 0.10379679368164356),
('last', 0.10370566043971788),
('when', 0.1028897983398213),
('treatment', 0.10178269430994238),
('dog', 0.10168404092851775),
('serial', 0.10161017641078243),
('from', 0.10106647591230367),
('company', 0.10086061033497386),
('see', 0.099917524764184903),
('needs', 0.099845334969716121),
('strange', 0.099219247635743829),
('design', 0.099206650083448103),
('numbers', 0.099090902644230969),
('first', 0.098955089925380019),
('six', 0.098781544559783718),
('teenager', 0.098734840685689051),
('perhaps', 0.098596115569284243),
('cross', 0.098238439583413259),
('lot', 0.097564755867877814),
('stars', 0.096659404205090227),
('listen', 0.096460266187562316),
('meanwhile', 0.096073830089622239),
('week', 0.095725203674154269),
('an', 0.095336742933817303),
('hits', 0.095310179804324935),
('results', 0.094615976374849128),
('place', 0.094597844501735473),
('wall', 0.093218128832100788),
('screen', 0.093090423066012035),
('officer', 0.092592786827824888),
('three', 0.092400179829382326),
('mix', 0.092206193866733843),
('fred', 0.092170459799657101),
('missed', 0.091937495325685639),
('help', 0.089675228287008593),
('since', 0.089468159841429057),
('convincing', 0.088947486016496116),
('changes', 0.087911872322879892),
('water', 0.087647307058755675),
('big', 0.087462267506023539),
('whether', 0.086401434915215417),
('depth', 0.085990447855522512),
('cute', 0.085815919584856126),
('problems', 0.085637885161817195),
('film', 0.085618555650856729),
('spanish', 0.08515780834030677),
('government', 0.084956722475965391),
('mission', 0.083699018876646838),
('wish', 0.083555885690973566),
('falling', 0.083381608939051),
('issue', 0.083381608939051),
('wants', 0.082280679513991054),
('situations', 0.082013151660835074),
('which', 0.080721093142199482),
('set', 0.080695491821007242),
('rate', 0.079787116617831902),
('captain', 0.079552631701953688),
('kate', 0.079512062927733607),
('extras', 0.079336742236521013),
('lost', 0.078598068066187896),
('revealed', 0.077558234345874444),
('agent', 0.077386663615420195),
('part', 0.076751339578802424),
('satire', 0.076372978784573956),
('chris', 0.0758657507747134),
('little', 0.07583626238355369),
('my', 0.075531730165573088),
('majority', 0.07534943724178679),
('keep', 0.074941421292118657),
('woody', 0.074723546195936574),
('public', 0.074370542721069938),
('dad', 0.073203404023294921),
('player', 0.073122264828962585),
('ex', 0.072526444068262585),
('suffering', 0.071743904858841315),
('consider', 0.070067562616716844),
('devil', 0.069869679960486),
('decides', 0.069497794496921506),
('s', 0.069425993723984988),
('close', 0.069418765871583257),
('faces', 0.069391993423999793),
('difference', 0.068992871486951421),
('actress', 0.068879678889626456),
('witch', 0.068137805167218041),
('took', 0.067236944784686545),
('soft', 0.06713930283762852),
('hitler', 0.066691374498672143),
('numerous', 0.066445099408152755),
('does', 0.066343124451621341),
('creepy', 0.065751377562780433),
('hat', 0.065751377562780433),
('room', 0.065632014958942553),
('provide', 0.065382759262851711),
('martin', 0.065240521868400944),
('odd', 0.065203193810762075),
('death', 0.065012406669236716),
('tim', 0.064538521137571164),
('broken', 0.064538521137571164),
('effect', 0.063112423310056259),
('anderson', 0.062800901239030441),
('runs', 0.062276929521514507),
('station', 0.062131781107006179),
('create', 0.062010074784212534),
('ultimately', 0.061321890874318948),
('heavy', 0.060870715087025913),
('cage', 0.06062462181643484),
('filmed', 0.060148846690498754),
('food', 0.059898141581069014),
('the', 0.05902269426102881),
('woman', 0.058672047052498608),
('sci', 0.057679111586677989),
('now', 0.056463107132025653),
('till', 0.056089466651043578),
('second', 0.05605134890532238),
('flying', 0.055880458394456628),
('knew', 0.05563169598614863),
('robot', 0.055059777183027389),
('taking', 0.054406722207164263),
('a', 0.053580080386005403),
('type', 0.053298581724361922),
('ups', 0.052446475372542524),
('themselves', 0.052332615458067437),
('hearing', 0.051735674399188893),
('aged', 0.051293294387550481),
...]
In [23]:
list(reversed(pos_neg_ratios.most_common()))[0:30]
Out[23]:
[('boll', -4.0778152602708904),
('uwe', -3.9218753018711578),
('seagal', -3.3202501058581921),
('unwatchable', -3.0269848170580955),
('mst', -2.7753833211707968),
('incoherent', -2.7641396677532537),
('unfunny', -2.5545257844967644),
('waste', -2.4907515123361046),
('blah', -2.4475792789485005),
('horrid', -2.3715779644809971),
('pointless', -2.3451073877136341),
('atrocious', -2.3187369339642556),
('redeeming', -2.2667790015910296),
('prom', -2.2601040980178784),
('drivel', -2.2476029585766928),
('lousy', -2.2118080125207054),
('worst', -2.1930856334332267),
('laughable', -2.172468615469592),
('awful', -2.1385076866397488),
('poorly', -2.1326133844207011),
('wasting', -2.1178155545614512),
('remotely', -2.111046881095167),
('existent', -2.0024805005437076),
('boredom', -1.9241486572738005),
('miserably', -1.9216610938019989),
('sucks', -1.9166645809588516),
('uninspired', -1.9131499212248517),
('lame', -1.9117232884159072),
('insult', -1.9085323769376259),
('uninteresting', -1.8782515005814986)]
In [ ]:
Content source: swirlingsand/deep-learning-foundations
Similar notebooks: